Multiple sequence alignment with user-defined constraints
نویسندگان
چکیده
In many situations, automated multi-alignment programs are not able to correctly align families of nucleic acid or protein sequences. Difficult cases comprise not only distantly related sequences but also tandem duplications independent of their evolutionary age. Frequently, additional biological information is available that establishes homologies at least in parts of the sequences based on structural or functional consideration. In the present paper, we describe a semi-automatic approach to multiple sequence alignment in which the user can explicitly specify parts of the sequences that are biologically related to each other. Our software program uses these sites as anchor points and creates a multiple alignment that respects these user-defined constraints and hence should be biologically more plausible than alignments produced by fully automated procedures. We apply our approach to genomic sequences adjacent to the Hox genes. As a by-product, we obtain not only useful insights for the further development of alignment algorithms, but also an improved approach to phylogenetic footprinting.
منابع مشابه
Multiple sequence alignment with user-defined constraints at GOBICS
Most multi-alignment methods are fully automated, i.e. they are based on a fixed set of mathematical rules. For various reasons, such methods may fail to produce biologically meaningful alignments. Herein, we describe a semi-automatic approach to multiple sequence alignment where biological expert knowledge can be used to influence the alignment procedure. The user can specify parts of the sequ...
متن کاملMultiple sequence alignment with user-defined constraints @ GOBICS
Summary: Most multi-alignment methods are fully automated, i.e. they are based on a fixed set of mathematical rules. For various reasons, such methods may fail to produce biologically meaningful alignments. Herein, we describe a semi-automatic approach to multiple sequence alignment where biological expert knowledge can be used to influence the alignment procedure. The user can specify parts of...
متن کاملAn Application of the ABS LX Algorithm to Multiple Sequence Alignment
We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...
متن کاملMultiple Sequence Alignments with Regular Expression Constraints on a Cloud Service System
Multiple sequence alignments with constraints are of priority concern in computational biology. Constrained sequence alignment incorporates the domain knowledge of biologists into sequence alignments such that the user-specified residues/segments are aligned together according to the alignment results. A series of constrained multiple sequence alignment tools have been developed in relevant lit...
متن کاملPROMALS3D web server for accurate multiple protein sequence and structure alignments
Multiple sequence alignments are essential in computational sequence and structural analysis, with applications in homology detection, structure modeling, function prediction and phylogenetic analysis. We report PROMALS3D web server for constructing alignments for multiple protein sequences and/or structures using information from available 3D structures, database homologs and predicted seconda...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004